Making the Release of Confidential Data from Multi-Way Tables Count
نویسندگان
چکیده
tatistical disclosure limitation (SDL) and confidentiality have often been shrouded with a nonstatisti-cal veil and the methodology for protecting confidential data has produced problematic outcomes for research data users. Here we describe one possible statistical approach to SDL for data in the form of multidimensional contingency tables that illustrates the following points: • For categorical data the traditional form of reporting has been marginal tables and conditionals. • Releasing such partial information is compatible with and useful for statistical methods for log-linear models and directed acyclic graphs. • Interesting new research problems arise in this area. From the early part of the 20 th century , confidentiality has been an important element of the mantra of statistical agencies, and it became embedded in the culture of the U.S. Census Bureau with the protections associated with the 1929 Census Act (now known as Title 13 of the U.S. Code). But the term confidentiality was always thought about in terms of the protection of individual and establishment data and not the release of data to policy makers, researchers, and the public. Moreover, confidentiality represented for many an " absolute " concept and it was not until the 1970s that there was movement toward making statistical thinking more central to the operational implementation of confidentiality protection. More specifically, the President's Commission on Federal Statistics, issued in 1971, placed special emphasis on confidentiality and subsequently , when the Office of Management and Budget's Statistical Policy office created the Federal Committee on Statistical Methodology, one of its first activities was a study of confidentiality and statistical disclosure protection (Working Paper No. 2). Working paper No. 2 was of special interest in part because it signaled for the first time the importance of the The confluence of research on disclosure methods and of research on log-linear model theory uncover the essential elements needed by analysts working with discrete data. trade-off between access and confidentiality and presented Tore Dalenius's probabilistic notion of disclosure: " If the release of the statistics S makes it possible to determine the value [of confidential statistical data] more accurately than is possible without access to S, a disclosure has taken place. " [Working Paper No. 2, pp. 7 and 9] The past 25 years have seen the growth of disclosure limitation as a statistical subdiscipline, and the term itself which was a change from that used in the 1970s, recognized that the …
منابع مشابه
Optimal Tabular Releases from Confidential Data
We describe and illustrate NISS-developed optimal tabular release technology, which releases sets of sub-tables of large contingency tables that maximize data utility (in our examples, the number of sub-tables released) subject to a constraint on disclosure risk (tightness of bounds on small-count, risky cells in the underlying table). This approach explicitly accommodates the mandate of Federa...
متن کاملPartial Information Releases for Confidential Contingency Table Entries: Present and Future Research Efforts
Tabular data have been a staple product for disseminating information derived from the confidential microdata that fuel social science research and inform policy decisions. This paper outlines recent results on disclosure risk assessment associated with the release of high-dimensional contingency tables, and discusses some related research problems. The main focus is the partial information rel...
متن کاملLoading of Gentamicin Sulfate into Poly (Lactic-Co-Glycolic Acid) Biodegradable Microspheres
Objective: In dental treatments, use of carriers for targeted antibiotic delivery would be optimal to efficiently decrease microbial count. In this study, gentamicin was loaded into polylactic co-glycolic acid (PLGA) microspheres and its release pattern was evaluated for 20 days. Methods: In this experimental study, PLGA microspheres loaded with gentamycin were produced by the W/O/W method....
متن کاملRounding methods for protecting EU-aggregates
In the European Statistical System the statistical information is collected by the National Statistical Institutes (NSIs). The NSIs produce aggregate tables at the national level. They are also responsible for proper protection of these tables and hence they have to keep certain cells confidential, suppressing them from publications. Eurostat produces statistical information at the EU-level. Ho...
متن کاملPartial Association Components in Multi-way Contingency Tables and Their Statistiical Analysis
In analyses of contingency tables made up of categorical variables, the study of relationship between the variables is usually the major objective. So far, many association measures and association models have been used to measure the association structure present in the table. Although the association measures merely determine the degree of strength of association between the study varia...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004